Use base-e (not 10) in expected_win_probability #54

dexonsmith · 2024-01-11T21:12:24Z

Change expected_win_probability to use base-e (instead of base-10), to match Glicko-2. v5_glicko2_update was already (correctly) using base-e.

This follow-up to 87c3fa8 gets handles the math domain error from tallying results on large small board handicaps. The base-10 bug made the expected win rate 1.0 (100%).

Of course, it's still possible to skip large handicap games when tallying results... but no longer necessary.

Relates to #45.

Change `expected_win_probability` to use base-e (instead of base-10), to match Glicko-2. `v5_glicko2_update` was already (correctly) using base-e. This follow-up to 87c3fa8 gets handles the math domain error from tallying results on large small board handicaps. The base-10 bug made the expected win rate 1.0 (100%). Of course, it's still possible to skip large handicap games when tallying results... but no longer necessary.

dexonsmith · 2024-01-11T21:14:27Z

goratings/math/glicko2.py

-                    / 400
-                )
-            )
+                    / 400)


@anoek, I'm skeptical of this / 400. Any thoughts on why it's here?

dexonsmith · 2024-01-11T21:21:02Z

@anoek, this function expected_win_probability shows up in other repos, with the same bug, but aside from unit tests it doesn't seem to be USED by anything (except prediction_cost in this, the goratings repo).

Let me know if/when to send out parallel PRs. Personally I'd be tempted to delete it from the other repos since it's not used there but up to you.

anoek · 2024-01-11T21:39:04Z

Ah, so the source of confusion is that this function is using the original glicko expected outcome formula on page 5 of http://www.glicko.net/glicko/glicko.pdf , as opposed to from http://glicko.net/glicko/glicko2.pdf

The original formula was this:

so that's where the 10** and / 400 come from

Note that the actual glicko2 implementation uses the E as defined by the glicko2 paper, this method was just used for that predicted outcome tally.

I'm guessing what happened is I did the original implemenation, came back to it a year or two later, thought "Hey I want to look at the estimated outcome", searched the glicko2 paper and didn't find the keywords I wanted and didn't pause long enough to think about the math, meanwhile the original glicko paper had a function that sounded like what I wanted, and given my understanding that glicko and glicko2 are completely compatible with one another, just glicko2 updates faster/better, I probably said to myself, this should work just fine for crunching some simple analytics. And I assume it does, though I assume using E from the glicko2 paper would be an improvement.

anoek · 2024-01-11T21:59:25Z

@anoek, this function expected_win_probability shows up in other repos, with the same bug, but aside from unit tests it doesn't seem to be USED by anything (except prediction_cost in this, the goratings repo).

Let me know if/when to send out parallel PRs. Personally I'd be tempted to delete it from the other repos since it's not used there but up to you.

Yeah I reckon eliminating it is probably a good thing. Honestly I don't know that it's useful here either, it's not immediately obvious to me whether using the value like we do produces anything meaningful. Seems like if we do keep it, we should update it though.

dexonsmith · 2024-01-11T22:00:27Z

@anoek, this function expected_win_probability shows up in other repos, with the same bug, but aside from unit tests it doesn't seem to be USED by anything (except prediction_cost in this, the goratings repo).
Let me know if/when to send out parallel PRs. Personally I'd be tempted to delete it from the other repos since it's not used there but up to you.

Yeah I reckon eliminating it is probably a good thing. Honestly I don't know that it's useful here either, it's not immediately obvious to me whether using the value like we do produces anything meaningful. Seems like if we do keep it, we should update it though.

Well, it's used in goratings to compute the prediction_cost, which is the main thing you look at, so we can't eliminate it here...

dexonsmith · 2024-01-11T22:01:42Z

Ah, so the source of confusion is that this function is using the original glicko expected outcome formula on page 5 of http://www.glicko.net/glicko/glicko.pdf , as opposed to from http://glicko.net/glicko/glicko2.pdf

The original formula was this:

so that's where the 10** and / 400 come from

Note that the actual glicko2 implementation uses the E as defined by the glicko2 paper, this method was just used for that predicted outcome tally.

I'm guessing what happened is I did the original implemenation, came back to it a year or two later, thought "Hey I want to look at the estimated outcome", searched the glicko2 paper and didn't find the keywords I wanted and didn't pause long enough to think about the math, meanwhile the original glicko paper had a function that sounded like what I wanted, and given my understanding that glicko and glicko2 are completely compatible with one another, just glicko2 updates faster/better, I probably said to myself, this should work just fine for crunching some simple analytics. And I assume it does, though I assume using E from the glicko2 paper would be an improvement.

Aha. I should update it more fully then before pushing. I'll make this a draft for now.

anoek · 2024-01-11T22:06:10Z

Well, it's used in goratings to compute the prediction_cost, which is the main thing you look at, so we can't eliminate it here...

It's not though, the main thing I was looking at was the handicap stuff at the top, it's all about those black win rates being consistent - if you've those pegged then it means your rating to ranking map is good. Everything else is just to help validate what I was seeing, maybe develop some further intuitions, that sort of stuff. So in that regard, sure might as well update it, but the main thing is optimizing for ensuring that if we have a rank difference between players, that corresponds directly to how much of a handicap they should have.

dexonsmith · 2024-01-11T22:08:29Z

Well, it's used in goratings to compute the prediction_cost, which is the main thing you look at, so we can't eliminate it here...

It's not though, the main thing I was looking at was the handicap stuff at the top, it's all about those black win rates being consistent - if you've those pegged then it means your rating to ranking map is good. Everything else is just to help validate what I was seeing, maybe develop some further intuitions, that sort of stuff. So in that regard, sure might as well update it, but the main thing is optimizing for ensuring that if we have a rank difference between players, that corresponds directly to how much of a handicap they should have.

Oh, the black win rate. I misunderstood which table you were looking at :/. (My fault... you did say the FIRST table, but I somehow was looking at the second.)

dexonsmith · 2024-01-11T23:04:07Z

I think cc296a6 updates it correctly / the rest of the way. This makes the math domain error come back... so now I'm not sure it really makes a difference.

dexonsmith · 2024-01-11T23:08:58Z

Effectively replaced this with #57 for now.

dexonsmith · 2024-01-11T23:29:29Z

I think cc296a6 updates it correctly / the rest of the way. This makes the math domain error come back... so now I'm not sure it really makes a difference.

I'm going to close this one and re-open a clean PR to consider.

dexonsmith requested a review from anoek January 11, 2024 21:12

dexonsmith changed the title ~~Use base-e (not-10) in expected_win_probability~~ Use base-e (not 10) in expected_win_probability Jan 11, 2024

dexonsmith force-pushed the expected-win-probability-base-e branch from 170fee1 to 1141878 Compare January 11, 2024 21:12

dexonsmith commented Jan 11, 2024

View reviewed changes

dexonsmith marked this pull request as draft January 11, 2024 22:01

Finish updating expected_win_probability to match glicko2_update

cc296a6

dexonsmith mentioned this pull request Jan 11, 2024

When tallying, cap the expected win rate at 1:1M #57

Merged

dexonsmith changed the title ~~Use base-e (not 10) in expected_win_probability~~ Update math in 'expected_win_probability' to match 'glicko2_update' Jan 11, 2024

dexonsmith changed the title ~~Update math in 'expected_win_probability' to match 'glicko2_update'~~ Use base-e (not 10) in expected_win_probability Jan 11, 2024

dexonsmith closed this Jan 11, 2024

dexonsmith mentioned this pull request Jan 11, 2024

Update expected_win_probability to match math in glicko_update #58

Merged

dexonsmith deleted the expected-win-probability-base-e branch January 11, 2024 23:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use base-e (not 10) in expected_win_probability #54

Use base-e (not 10) in expected_win_probability #54

dexonsmith commented Jan 11, 2024 •

edited

Loading

dexonsmith Jan 11, 2024

dexonsmith commented Jan 11, 2024

anoek commented Jan 11, 2024

anoek commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

anoek commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

Use base-e (not 10) in expected_win_probability #54

Use base-e (not 10) in expected_win_probability #54

Conversation

dexonsmith commented Jan 11, 2024 • edited Loading

dexonsmith Jan 11, 2024

Choose a reason for hiding this comment

dexonsmith commented Jan 11, 2024

anoek commented Jan 11, 2024

anoek commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

anoek commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024

dexonsmith commented Jan 11, 2024 •

edited

Loading